Prospective and retrospective temporal difference learning.
نویسنده
چکیده
A striking recent finding is that monkeys behave maladaptively in a class of tasks in which they know that reward is going to be systematically delayed. This may be explained by a malign Pavlovian influence arising from states with low predicted values. However, by very carefully analyzing behavioral data from such tasks, La Camera and Richmond (2008) observed the additional important characteristic that subjects perform differently on states in the task that are at equal distances from the future reward, depending on what has happened in the recent past. The authors pointed out that this violates the definition of state value in the standard reinforcement learning models that are ubiquitous as accounts of operant and classical conditioned behavior; they suggested and analyzed an alternative temporal difference (TD) model in which past and future are melded. Here, we show that, in fact, a standard TD model can actually exhibit the same behavior, and that this avoids deleterious consequences for choice. At the heart of the model is the average reward per step, which acts as a baseline for measuring immediate rewards. Relatively subtle changes to this baseline occasioned by the past can markedly influence predictions and thus behavior.
منابع مشابه
Comparison of Prospective and Retrospective Memory and Attention in Patients with Chronic Low Back Pain with Healthy People
Objective: The present study aimed to compare prospective and retrospective memory impairment and attention deficit in people suffering from chronic low back pain with those cognitive functions in healthy subjects. Furthermore, this study examines the relation between severity and duration of pain and prospective and retrospective memory impairment and attention deficit. Me...
متن کاملControl of Multivariable Systems Based on Emotional Temporal Difference Learning Controller
One of the most important issues that we face in controlling delayed systems and non-minimum phase systems is to fulfill objective orientations simultaneously and in the best way possible. In this paper proposing a new method, an objective orientation is presented for controlling multi-objective systems. The principles of this method is based an emotional temporal difference learning, and has a...
متن کاملProspective and Retrospective Memory Complaints in HIV-Infected Individuals
Background and purpose: Deficits of retrospective memory (RetM) and prospective memory (ProM), which are two primary components of episodic memory, have been reported in prior studies in HIV-infected individuals. The present study aimed to further elucidate the characteristics of prospective and retrospective memory complaints in HIV-infected individuals. Materials and Methods: We recruited 50...
متن کاملLow kilovolt “prospective ECG-triggering” vs. “retrospective ECG-gating” coronary CTA: comparison of image quality and radiation dose
Background: To compare image quality and radiation doses of low kilovolt (kV) “prospective ECG-triggering” (PT) and standard “retrospective ECG-gating” (RG) coronary computed tomography (CT) angiography. Materials and Methods: A total of 101 consecutive patients (76 males, 25 females; mean age: 55.44 ± 8.28 years) with low-to-intermediate risk status for coronary artery disease and with a body ...
متن کاملAssociation between prospective, retrospective memories and job performance of nurses in the city of Ahvaz, Iran, in 2016
Background: Nurses continuously carry out several activities and need to develop certain cognitive mechanisms to be able to concentrate on their tasks. These activities and tasks affect and deal with the health and safety of people and their job performance. This study aims to investigate the association between prospective and retrospective memories and job performance of nurses in Imam Hospit...
متن کاملHandedness differences observed in episodic memory retrieval do not extend to the domain of prospective memory.
A robust inconsistent handed advantage exists for episodic retrospective memory processes. The current study was undertaken to test whether this handedness difference extends to the domain of prospective memory (PM). Two studies, one based on a self-report measure (the Prospective and Retrospective Memory Questionnaire) and the second involving a performance-based test (Memory for Intentions Sc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Network
دوره 20 1 شماره
صفحات -
تاریخ انتشار 2009